WOLAP: Wavelet-Based Range Aggregate Query Processing
نویسندگان
چکیده
The Discrete Wavelet Transform has emerged as an elegant tool for data analysis queries. It was not until the time we proposed a new wavelet technique, ProPolyne, for fast exact, approximate, or progressive polynomial aggregate query processing that data did not have to be compressed, unlike most of the prior studies in this area. In this paper, after reviewing our ProPolyne technique in more depth with more intuitive and practical discussions, we address its inefficiency in dealing with scientific datasets due to the cube sparseness, subsequently, we propose a new cube model, CFM, to enhance ProPolyne’s both space and query efficiency. While ProPolyne assumed storing the data as large data frequency distribution cubes, CFM organizes the data as a collection of smaller fixed measure cubes to reduce the overall query and storage costs. We combine both cube models in an integrated framework, called WOLAP, for efficient polynomial aggregate query processing. We further enhance WOLAP by proposing practical solutions for real-world deployment in scientific applications. In particular, we show how to incorporate data approximation, how to improve wavelet filter selection, and how to work on datacubes with arbitrary domain sizes.
منابع مشابه
Accuracy Vs Lifetime: Linear Sketches for Appoximate Aggregate Range Queries in Sensor Networks
Query processing in sensor networks is critical for several sensor based monitoring applications and poses several challenging research problems. The in–network aggregation paradigm in sensor networks provides a versatile approach for evaluating simple aggregate queries, in which an aggregation–tree is imposed on the sensor network that is rooted at the base–station and the data gets aggregated...
متن کاملProPolyne: A Fast Wavelet-Based Algorithm for Progressive Evaluation of Polynomial Range-Sum Queries
Many range aggregate queries can be eÆciently derived from a class of fundamental queries: the polynomial range-sums. After demonstrating how any range-sum can be evaluated exactly in the wavelet domain, we introduce a novel pre-aggregation method called ProPolyne to evaluate arbitrary polynomial range-sums progressively. At each step of the computation, ProPolyne makes the best possible wavele...
متن کاملHybrid Query and Data Ordering for Fast and Progressive Range-Aggregate Query Answering
Data analysis systems require range-aggregate query answering of large multidimensional datasets. We provide the necessary framework to build a retrieval system capable of providing fast answers with progressively increasing accuracy in support of range-aggregate queries. In addition, with error forecasting, we provide estimations on the accuracy of the generated approximate results. Our framew...
متن کاملProbabilistic Threshold Range Aggregate Query Processing over Uncertain Data
Large amount of uncertain data is inherent in many novel and important applications such as sensor data analysis and mobile data management. A probabilistic threshold range aggregate (PTRA) query retrieves summarized information about the uncertain objects satisfying a range query, with respect to a given probability threshold. This paper is the first one to address this important type of query...
متن کاملWavelet Disk Placement for E cient Querying of Large Multidimensional Data Sets
New data intensive applications operate on diverse types of data with new charac teristics in querying the data In particular the data set is large and multidimen sional popular examples are spatial and temporal data as well as sensor data streams the queries are complex asking for trends or outliers in data correlation between dif ferent dimensions or aggregation of one or more measure attribu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007